Content Security Management
Sensitive Words Management
The sensitive words management feature allows users to define, view, and manage a list of sensitive words. Through this feature, users can control and filter out unwanted content to comply with specific regulatory requirements or community guidelines.
The main functions of the sensitive words management page include:
- Add Sensitive Words: Add new sensitive words to the list to ensure they are identified and handled in future content.
- View Sensitive Words List: Provide an overview of all defined sensitive words, including their activation status, description, and creation time.
- Manage Sensitive Words: Allow users to enable or disable specific sensitive words and update their description information.
Content Review Model
Confidence typically refers to the degree of certainty a model or system has in its prediction results.
In the context of the content review model, confidence threshold settings are an important feature that allow users to define the minimum confidence level a model must reach before marking content as a specific category (e.g., spam, inappropriate content, etc.). This helps reduce false positives or false negatives, depending on the threshold setting.
For example, if the confidence threshold is set to 0.8, the model will only mark content as inappropriate when its confidence in the prediction is at least 80%. This ensures that actions are taken only in cases where the model is highly certain.